Association vs. Prediction vs. Causation
Association
- Two variables are associated means they are correlated in some way, they are not independent. But we don't know how exactly they affect each other
- Simply conducted multiple regression may only contribute to association
Prediction
- What the outcome will be given the predictor(s). The goal of predictive model is to find the model that minimize the prediction error for future data.
Causation
- What would happen to the outcome when we change the predictor(s). The goal of causal inference is to find the best unbiased estimator of the model parameter.
- Causality subsumes prediction and is more than prediction
From Prediction vs. Causation in Regression Analysis | Statistical Horizons
There are two main uses of multiple regression: prediction and causal analysis. In a prediction study, the goal is to develop a formula for making predictions about the dependent variable, based on the observed values of the independent variables….In a causal analysis, the independent variables are regarded as causes of the dependent variable. The aim of the study is to determine whether a particular independent variable really affects the dependent variable, and to estimate the magnitude of that effect, if any.